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Abstract 

'The  minimax  grid  matching  problem  is  a  fundamental  combinatorial  problem 
associated  with  the  average  case  analysis  of  algorithms.  The  problem  has 
arisen  in  a  number  of  interesting  and  seemingly  unrelated  areas,  including 
wafer-scale  integration  of  systolic  arrays,  two-dimensional  discrepancy 
problems,  and  testing  pseudorandom  number  generators.  However,  the  minimax 
grid  matching  problem  is  best  known  for  its  application  to  the  maximum  up¬ 
right  matching  problem.  The  maximum  up-right  matching  problem  was  originally 
defined  by  Karp,  Luby  and  Marche tti-Spaccamela  in  association  with  algorithms 
for  2-dimensional  bin  packing.  More  recently,  the  up-right  matching  problem 
has  arisen  in  the  average  case  analysis  of  on-line  algorithms  for 
1-dimensional  bin  packing  and  dynamic  allocation., 

In  this  paper,  solve  both  the  minimax  grid  matching  problem  and  the  maximum 
up-right  matching  problem.  As  a  direct  result,  we  obtain  tight  upper  bounds 
on  the  average  case  behavior  of  the  best  algorithms  known  for  2-dimensional 
bin  packing,  l-diraensional  on-line  packing  and  on-line  dynamic  allocation. 

The  results  also  solve  a  long-open  question  in  mathematical  statistics. 
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Abstract:  The  minimax  grid  matching  problem  is  a  fundamental  combinatorial  prob¬ 
lem  associated  with  the  average  case  analysis  of  algorithms.  The  problem  has  arisen  in  a 
number  of  interesting  and  seemingly  unrelated  areas,  including  wafer-scale  integration  of 
systolic  arrays,  two-dimensional  discrepancy  problems,  and  testing  pseudorandom  number 
generators.  However,  the  minimax  grid  matching  problem  is  best  known  for  its  application 
to  the  maximum  up-right  matching  problem.  The  maximum  up-right  matching  problem 
was  originally  defined  by  Karp,  Luby  and  Marchetti-Spaccamela  in  association  with  algo¬ 
rithms  for  2-dimensional  bin  packing.  More  recently,  the  up-right  matching  problem  has 
arisen  in  the  average  case  analysis  of  on-line  algorithms  for  1-dimensional  bin  packing  and 
dynamic  allocation. 

In  this  paper,  we  solve  both  the  minimax  grid  matching  problem  and  the  maximum  up¬ 
right  matching  problem.  As  a  direct  result,  we  obtain  tight  upper  bounds  on  the  average 
case  behavior  of  the  best  algorithms  known  for  2-dimensional  bin  packing,  1-dimensional 
on-line  packing  and  on-line  dynamic  allocation.  The  results  also  solve  a  long-open  question 
in  mathematical  statistics. 
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1.  Introduction 

Consider  a  square  with  area  N  in  the  plane  that  contains  N  grid  points  arranged 
in  a  regularly  spaced  v/7V  x  \/l^  array  and  N  random  points  located  independently  and  \ 

randomly  according  to  the  uniform  distribution  on  the  square.  For  example,  see  Figure  1. 
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Figure  1:  A  square  with  area  N  containing  a  regularly  spaced  y/N  x  y/N  grid  (denoted 
with  black  dots)  and  N  uniformly  distributed  random  points  (denoted  with  x’s). 


For  any  particular  set  of  N  random  points  P,  let  L[P)  denote  the  minimum  length 
such  that  there  exists  a  perfect  matching  of  the  (random)  points  in  P  to  the  grid  points  in 
the  square  for  which  the  distance  between  every  pair  of  matched  points  is  at  most  L{P). 
In  other  words,  L{P)  is  the  minimum  over  all  perfect  matchings  of  the  maximum  distance 
between  any  pair  of  matched  points,  or  more  simply  the  minimax  matching  length  for 
P.  As  an  example.  Figure  2  illustrates  two  matchings,  one  that  achieves  the  minimax 
matching  length  and  one  that  does  not. 
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Figiire  2:  Two  possible  matchings  for  a  set  of  random  points,  one  which  achieves  the 
mtmmax  matching  length  (b)  and  one  which  does  not  (a). 
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Algorithmically  speaking,  the  problem  of  computing  the  minimaix  matching  length  for 
.  any  set  P  is  relatively  straightforward,  and  is  not  of  concern  in  this  paper.  Nor  are  we 

interested  in  the  worst  case  value  of  L{P)  over  all  P,  which  is  trivially  Q{y/N).  Rather, 
we  are  interested  in  the  eipecfed  value  of  L(P)  for  random  P. 

Initially,  one  might  hope  that  the  expected  value  of  L{P)  is  a  constant  independent  of 
N  since  every  random  point  is  within  distance  \/2/2  of  a  grid  point.  With  high  probability, 
however,  the  same  is  not  true  for  every  grid  point.  In  particular,  it  is  not  difhcult  to  show 
that  with  probability  exceeding  1  —  ^  there  is  a  circular  region  with  0(logN)  area  in  the 
square  that  does  not  contain  any  random  points  at  all,  and  hence  L{P)  >  n(v/log  N)  with 
high  probability.  For  example,  see  Figure  3. 


Circle  of  area 
0(loKAf)  not 
containing  any 
random  points 


Figure  3:  Collection  of  N  random  points  P  for  which  there  is  a  circle  of  area  ©(logW) 
not  containing  any  random  points.  The  grid  point  (not  shown  here)  nearest  to  the  center 
of  this  circle  is  n(v^og  N)  away  from  every  random  point  and  hence,  L{P)  >  n(v/l3ii?) 
for  this  example. 

Although  the  minimax  grid  matching  problem  (that  of  determining  the  expected  value 
of  L{P)  for  random  P  ),  has  not  to  our  knowledge  been  directly  considered  before,  Leighton 
and  Leiserson  [13]  and  Ajtai,  Komlos  and  Tusnady  [l]  considered  similar  problems,  and 
developed  probabilistic  divide-and-conquer  techniques  that  show  L{P)  <  O(logN)  with 
high  probability.  The  resulting  ©(ylog  N)  gap  in  the  bounds  for  the  expected  value  of 
L{P)  remained  for  some  time  until  Shor  [21,  22]  proved  that  L[P)  >  n(log^^^  N)  with  very 
high  probability*  (i.e.,  with  probability  exceeding  1  —  l/W“  where  a  =  fl(\/log  N)).  Shor’s 
result  was  substantially  more  difficult  than  the  rather  elementary  n(\/log  N)  bound,  and 
suggested  that  the  minimax  grid  matching  problem  possessed  a  deeper  structure  than  one 
might  initially  realize. 

In  this  paper,  we  complete  the  asymptotic  analysis  of  the  minimax  grid  matching 
problem  by  proving  that  L[P)  <  0{\o^^*  N)  with  very  high  probability.  Hence  L(P}  = 
©(log®*^^  N)  with  very  high  probability,  and  the  expected  value  of  L(P)  is  ©(log*^^  N). 

Aside  from  being  an  interesting  combinatorial  problem,  the  improved  upper  bound  for 

*  Henceforth,  we  will  reserve  the  phrase  “with  high  probability”  to  mean  “with  probability  exceeding 
1  -  l/A**  for  any  constant  a  >  1,”  and  the  phrase  “with  very  high  probability”  to  mean  “with  probability 
exceeding  1  —  l/A"  where  a  =  flfv/log  N). 
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typical  values  of  L{P)  hcis  important  implications  for  the  average  case  behavior  of  a  wide 
variety  of  algorithms.  For  example,  as  a  direct  consequence  of  this  result,  we  now  know  that 
the  1-dimensional  Best  Fit  bin  packing  algorithm  wastes  Q{y/N  N)  space  with  very 
high  probability  when  packing  N  items  with  sizes  determined  by  identical  and  independent 
distributions  that  are  symmetric  about  1/2  in  [0,1].  Similar  consequences  hold  for  the 
Karp-Luby-Marchetti-Spaccamela  2-dimensional  bin  packing  algorithm  and  the  Coffman- 
Leighton  Best  Fit  Aligned  dynamic  allocation  algorithm.  Our  result  also  has  implications 
for  the  mzLximum  up-right  matching  problem,  a  long  open  discrepancy  problem,  wafer-scale 
integration  of  systolic  arrays,  and  potentially  for  testing  pseudorandom  number  generators. 

Establishing  the  connection  between  the  minimax  grid  matching  problem  and  the  wide 
variety  of  seemingly  unrelated  problems  just  mentioned  hzis,  of  course,  required  a  great  deal 
of  work  by  many  researchers.  Hence,  it  is  not  possible  to  provide  detailed  explanations  for 
all  of  the  connections  in  this  paper.  Instead,  we  include  only  the  relevant  definitions  and 
statements  of  results  for  each  connection,  referring  the  interested  reader  to  other  sources 
for  details. 

The  remainder  of  the  paper  is  divided  into  six  sections.  In  Section  2,  we  describe  some 
related  combinatorial  problems.  In  particular,  we  discuss  the  maximum  up-right  matching 
problem  and  several  2-dimensional  discrepancy  problems.  In  Section  3,  we  discuss  the 
application  of  the  minimaix  grid  matching  problem  to  bin  packing,  dynamic  allocation, 
wafer-scale  integration  and  testing  pseudorandom  number  generators.  The  proof  of  the 
0(log*'^'*  N)  upper  bound  for  minimax  grid  matching  is  contained  in  Section  4.  We  conclude 
with  some  remarks,  acknowledgements  and  references  in  Sections  5-7. 


2.  Related  Combinatorial  Problems 

In  this  section,  we  discuss  several  combinatorial  problems  that  are  closely  related  to  the 
minimax  grid  matching  problem.  These  problems  are  interesting  in  their  own  right,  but  are 
included  here  mostly  for  their  role  in  the  applications  described  in  Section  3  and  the  proof 
contained  in  Section  4.  We  commence  with  the  maximum  up-right  matching  problem  in 
Section  2.1  and  finish  with  a  discussion  of  two-dimensional  discrepancy  problems  in  Section 
2.2. 

2.1.  The  Maximum  Up-Right  Matching  Problem 

Consider  a  square  with  area  N  in  the  plane  that  contains  N  random  pluses  and  N 
random  minuses  located  independently  and  randomly  according  to  the  uniform  distribution 
on  the  square.  Given  a  set  of  N  pluses  and  N  minuses  P^,  an  up-right  matching  on  P^ 
is  a  one-to-one  matching  of  pluses  to  minuses  such  that  every  plus  is  either  unmatched  or 
is  matched  to  a  single  minus  that  lies  above  and  to  the  right  of  the  plus.  A  maximum  up¬ 
right  matching  for  is  an  up-right  matching  which  minimizes  the  number  of  unmatched 
points.  For  example,  Figure  4  illustrates  two  up-right  matchings.  One  is  maximum,  the 
other  is  not. 

Let  U{P^)  denote  the  number  of  unmatched  pluses  in  a  maximum  up-right  matching 
for  P^.  For  example,  (/{P"^)  —  1  for  the  collection  of  pluses  and  minuses  shown  in  Figure 
4.  As  with  minimax  grid  matching,  the  algorithmic  problems  of  determining  UlP"^)  and 
finding  a  maximum  up-right  matching  for  a  particular  collection  of  pluses  and  minuses  are 
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Figure  4:  Two  up-right  matchings.  Only  the  matching  in  (b)  is  maximum. 

elementary  and  not  of  interest  here.  Rather,  we  are  interested  in  determining  the  expected 
and/or  high  probability  values  of  U{P^)  for  random  P^. 

It  is  not  difRcult  to  show  that  the  expected  value  of  U{P'^)  is  Cl{y/N).  Simply  consider 
the  pluses  and  minuses  that  lie  in  the  upper  half  of  the  square.  Since  pluses  in  the  upper 
half  of  the  square  can  only  be  matched  to  minuses  in  the  upper  half  of  the  square,  the 
number  of  upper-half  pluses  less  the  number  of  upper  half  minuses  is  always  a  lower  bound 
on  the  number  of  unmatched  pluses  in  a  maximum  up-right  matching.  The  numbers  of 
upper-half  pluses  and  minuses  are  governed  by  simple  binomial  distributions  with  mean 
N/2  and  variance  N/4.  Hence,  the  number  of  upper-half  pluses  will  exceed  the  number  of 
upper-half  minuses  by  at  least  with  some  constant  positive  probability.  Thus,  the 

expected  value  of  U(P^)  is  at  least  n(\/iV)* 

Karp,  Luby  and  Marchetti-Spaccamela  [10]  were  the  first  to  explicitly  define  the  mai- 
imum  up-right  matching  problem  (i.e.,  the  problem  of  determining  the  expected  value  of 
U[P^)  ).  By  applying  Hall’s  Theorem  [9]  and  analyzing  the  dual  problem,  they  showed 
that  U(P^)  <  0{y/N  log  N)  with  high  probability.  By  applying  an  elegant  argument  of 
Ajtai,  Komlos  and  Tusnady  [l],  they  also  showed  that  U{P^)  >  n(v/N\/Iog  N)  with  high 
probability. 

The  resulting  0(y1og  N)  gap  in  the  bounds  remained  until  Shor  [21,  22]  proved  that 
UiP^)  >  n(\/N  log^^*  N)  with  very  high  probability.  Not  surprisingly,  this  result  combined 
with  the  0{y/N  log  N)  upper  bound  led  to  the  conjecture  that  U[P^)  =  Q{\^N  log  N) 
with  high  probability.  As  it  turns  out,  however,  this  is  not  the  case.  Instead,  it  is  the 
Cl{y/N  log^'*  N)  lower  bound  that  is  correct.  In  fact,  we  prove  in  this  paper  that  U[P*')  = 
0(V^log*^*  TV)  with  very  high  probability. 

Judging  from  the  similarity  between  the  claimed  bounds  for  the  minimax  grid  matching 
problem  and  the  maximum  up-right  matching  problem,  it  should  not  be  surprising  that 
the  two  problems  are  closely  related.  In  fact,  any  very  high  probability  upper  bound  on 
L{P)  can  be  transformed  into  a  very  high  probability  upper  bound  for  U[P^)  by  simply 
multiplying  by  0(\/]V)-  To  see  this,  consider  an  up-right  matching  problem  with  N  random 
pluses  and  N  random  minuses  P~ .  (Here  P"^  —  P'^  \J  P~ .)  Let  d{N)  be  a  very  high 


probability  upper  bound  on  L{P).  Then,  with  very  high  probability,  L{P'^)  <  d{N)  and 
L{P~)  <  d{N).  In  other  words,  the  pluses  and  minuses  can  each  be  matched  to  N  regularly 
spaced  grid  points  so  that  each  grid  point  is  matched  to  a  plus  and  a  minus  that  are  within 
distance  at  most  d[N).  Next  form  a  matching  on  P^  by  matching  the  plus  identified 
with  grid  point  (i,j)  to  the  minus  identified  with  grid  point  (;  +  2d{N),j  -h  2d(N))  for 
{(*«  j)|l  <  *  <  —  2d(N),  1  <  j  <  y/N  —  2d{N)}.  For  example,  see  Figure  5. 


Figure  5:  The  matching  of  a  plus  to  a  minus  via  association  with  grid  points.  Since  the 
solid  matching  lines  have  distance  at  most  d{N),  the  minus  is  above  and  to  the  right  of  the 
plus. 

From  inspection  of  Figure  5,  it  is  clear  that  the  procedure  just  described  forms  an  up¬ 
right  matching.  The  only  pluses  not  matched  by  the  procedure  are  those  identified  with 
grid  points  in  the  topmost  2d{N)  rows  and  the  rightmost  2d{N)  columns.  As  there  are  less 
than  4d{N)y/N  such  points,  we  can  conclude  that  U{P^)  <  0{y/N d[N))  with  very  high 
probability.  Hence  the  claimed  very  high  probability  0{\/N  N)  bound  for  U{P^) 
immediately  follows  from  the  very  high  probability  0(log^'^^  iV)  bound  for  L[P)  proved  in 
Section  4. 

It  is  conceivable  that  a  symmetric  condition  is  also  true:  namely,  that  a  very  high 
probability  upper  bound  for  U(P^)  can  be  easily  transformed  into  a  very  high  probability 
upper  bound  for  L{P).  We  doubt  that  this  is  the  case,  however,  since  the  minimax  grid 
matching  problem  appears  (at  lecist  to  us)  to  be  fundamentally  harder  and  more  general 
than  the  maximum  up-right  matching  problem*.  Nevertheless,  the  upper  bound  for 
U{P^)  obtained  from  L[P)  is  tight  up  to  constant  factors. 

2.2.  Two-Dimensional  Discrepancy  Problems 

Discrepancy  problems  have  a  long  and  rich  history  in  mathematics  and  statistics  [6,  19, 
23,  25].  In  fact,  much  of  probability  theory  and  mathematical  statistics  involves  estimating 

*  This  is  true  in  higher  dimensions.  In  d-diinensional  space,  for  d  >  3,  Karp  et  al.  [lO]  prove  that  for 
upi-right  matctiing,  the  log  factor  disappears  and  U{P^]  =  0(  whereas  there  is  an  easy  lower 

bound  of  n(log*^'*A)  for  d-dimensional  minimax  grid  matching.  Hence  tlie  relationship  between  V(P^) 
and  L{P]  breaks  down  in  higher  dimensions. 
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the  likelihood  of  certain  discrepancies  (or  deviations  from  the  norm)  occurring  in  random 
data.  As  a  familiar  example,  consider  an  unbiased  random  walk  of  length  N.  It  is  a  well- 
known  consequence  of  the  reflection  principal  that  the  largest  deviation  from  the  origin 
during  the  walk  is  expected  to  be  0(\/^).  Hence,  the  largest  “swing”  observed  in  any 
subwalk  of  a  walk  of  length  N  is  also  expected  to  be  0(V^). 

This  basic  result  has  many  consequences,  and  can  be  stated  more  generally  as  an 
example  of  a  1-dimensional  discrepancy  problem.  For  example,  consider  a  fixed  interval  of 
length  N  that  contains  N  random  points  selected  according  to  independent  and  identical 
uniform  distributions.  Define  the  discrepancy  of  any  subinterval  to  be  the  absolute  value 
of  the  difference  between  the  expected  number  of  points  contained  in  the  interval  (i.e.,  the 
length  of  the  interval)  and  the  actual  number  of  points  contained  in  the  interval.  Using 
the  reflection  principle,  it  is  not  difficult  to  show  that  the  expected  maximum  discrepancy 
over  all  subintervals  is  0(v^).  Moreover,  with  high  probability  every  subinterval  heis 
discrepancy  at  most  0(\/Zv/log  N  +  log  Af),  where  L  is  the  length  of  the  subinterval. 


In  this  paper,  we  are  interested  in  two-dimensional  generalizations  of  the  preceding 
one-dimensional  results  on  intervals.  Several  different  generalizations  have  been  studied, 
depending  on  how  the  subintervals  are  generalized.  For  example,  consider  a  square  with 
area  N  that  contains  N  random  points  generated  according  to  independent  and  identical 
uniform  distributions.  For  any  region  R  of  the  square,  define  the  discrepancy  of  the  region 
A{i2)  to  be  the  absolute  value  of  the  difference  between  the  expected  number  of  points 
contained  in  R  (i.e.,  the  area  of  i2)  and  the  actual  number  of  points  in  R.  In  a  classic  paper, 
Kiefer  (ill  showed  that  the  expected  maximum  discrepancy  of  any  oriented  rectangle  is 
0(V^)  for  fixed  N.  Generalizing  the  law  of  the  iterated  logarithm,  he  also  showed  that 


the  limsup  of  the  maximum  discrepancy  of  any  oriented  rectangle  over  all  N  oo  is 
Q{y/N \/log  log  N)  with  probability  1  —  o(l).  These  results  were  later  extended  by  Philipp 
[16]  who  proved  the  same  bounds  for  arbitrary  convex  regions,  and  by  Leighton  [12]  who 
showed  that  with  high  probability,  every  convex  region  has  discrepancy  0(\/Ay/log  N  -t- 
log  N)  where  A  is  the  area  of  the  region. 


Of  greatest  interest  in  this  paper,  however,  is  the  work  on  maximum  discrepancies  of 
up-right  regions  (also  known  as  lower  layers)  in  the  square.  Up-right  regions  are  defined 
by  monotone  decreasing  curves.  For  example,  Figure  6  illustrates  a  rectilinear  up-right 
region  with  N  —  S,  area  2  and  discrepancy  1. 


By  applying  Hall’s  Matching  Theorem  in  the  usual  way,  it  is  not  difficult  to  see  that  the 
up-right  discrepancy  problem  is  closely  related  to  the  up-right  matching  problem.  More 
precisely,  if  we  consider  a  square  with  area  N  that  contains  N  random  pluses  and  N 
random  minuses  P~ ,  and  we  define  A^{R)  to  be  the  number  of  pluses  in  R  less  the  number 
of  minuses  in  R,  then  the  maximum  value  of  A'^lR)  over  all  up-right  regions  is  precisely 
the  number  of  unmatched  pluses  in  a  maximum  up-right  matching  for  P^  =  P'^  U  P~. 
For  random  distributions,  the  maximum  values  of  A*(i2)  and  A{R)  are  within  constant 
factors  of  each  other  with  very  high  probability,  so  we  can  conclude  from  the  bounds  on 
U{P'^)  cited  in  Section  1  that  the  maximum  value  of  A(i?)  over  all  up-right  regions  is 
Q{'/N  \og^^*  N)  with  very  high  probability. 


Asymptotically  characterizing  the  expected  maximum  value  of  A{R)  over  all  up-right 
regions  has  been  an  open  problem  in  mathematical  statistics  for  some  time.  Perhaps  Blum 
[3]  was  the  first  to  study  the  problem,  proving  a  o(N)  upper  bound  in  1955.  This  was  later 
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Up-right 

region 


Figure  6:  An  up-right  region  with  N  =  8,  area  S  and  discrepancy  1. 

improved  to  by  Steele  [23]  in  1977  and  then  to  0{y/N  log  N {log  log  N)^)  for  some 

constant  a  by  Philipp  [17|.  The  first  nontrivial  lower  bound  was  proved  by  Dudley  [7]  in 
1982.  He  showed  that  the  expected  maximum  value  of  A(i2)  for  all  up-right  regions  is  at 
least  n(v/jV v/log  N/{log  log  N)  5+*)  for  any  constant  £  >  0.  Karp,  Luby  and  Marchetti- 
Spaccamela  [lOj  were  the  first  to  bring  the  problem  into  the  computer  science  community, 
proving  an  upper  bound  of  0{y/N  log  N)  and  (using  an  elegant  argument  of  Ajtai,  Komlos 
and  Tusnady  [ij)  a  lower  bound  of  fl(\/]Vi/log  N).  The  0{y/N  log  N)  upper  bound  also 
follows  directly  from  prior  work  of  Leighton  and  Leiserson  [13]  and  Ajtai,  Komlos  and 
Tusnady  (ij.  The  lower  bound  was  later  improved  to  Cl{y/N  log^^*  N)  by  Shor  [21,  22], 
who  independently  developed  a  method  similar  to  that  followed  by  Dudley.  The  results  in 
this  paper  provide  final  resolution  to  the  problem  by  establishing  a  very  high  probability 
0{^/N  log^'*  N)  upper  bound  on  the  maximum  discrepany  of  any  up-right  region. 

A  priori,  it  is  not  clear  why  we  should  restrict  ourselves  to  either  convex  or  up-right 
regions  of  the  square  when  generalizing  the  1-dimensional  discrepancy  problem.  For  ex¬ 
ample,  why  not  consider  any  simply  connected  region?  (A  region  is  said  to  be  simply 
connected  if  it  is  connected  and  contains  no  holes.)  The  answer,  of  course,  is  that  we  can 
always  construct  a  simply  connected  region  that  contains  all  N  points,  but  that  has  area 
arbitrarily  close  to  zero.  Hence,  the  worst  possible  discrepancy  is  always  achieved. 

Not  all  is  lost,  however,  since  the  discrepancy  of  a  simply  connected  region  can  be 
bounded  in  terms  of  its  perimeter.  In  fact,  we  will  prove  in  Section  4  that  with  very  high 
probability,  the  discrepancy  of  every  simply  connected  region  is  at  most  0(p  log^'^'*  iV  + 
log^^*  N),  where  p  is  the  perimeter  of  the  region.  Note  that  this  result  naturally  generalizes 
the  bound  on  discrepancies  of  up-right  regions  since  p  =  0{\/N)  for  any  up-right  region. 
More  importantly,  however,  the  result  will  be  sufficient  to  prove  the  O(log^^^iV)  upper 
bound  for  minimax  grid  matching.  The  proof  of  this  fact  is  not  difficult  and  again  uses 
Hall’s  Theorem.  The  details  are  deferred  until  Section  4. 

Although  the  topic  is  not  of  direct  concern  in  this  paper,  two-dimensional  discrepancy 
problems  have  also  been  studied  in  a  worst-ccise  setting.  The  most  notable  result  in  this 
area  is  due  to  Schmidt  [20]  who  resolved  a  decades  old  open  question  by  proving  that 
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no  matter  how  N  points  are  arranged  in  a  square  of  area  N ,  there  is  always  an  oriented 
rectangle  in  the  square  which  contains  n(log  iV)  fewer  or  n(logiV)  more  points  than  its 
area.  In  other  words,  there  is  always  an  oriented  rectangle  with  discrepancy  n(logN). 
Constructions  which  achieve  a  maximum  discrepancy  of  O(logA^)  for  this  problem  were 
known  long  ago  (14,  24]  and  have  recently  been  rediscovered  in  the  computer  science 
literature  [8,  15|.  Curiously,  it  is  still  not  known  how  to  prove  either  the  “n(log7V)  fewer” 
or  the  “n(logiV)  more”  result  individually.  Even  more  startling  is  the  fact  that  there  is 
always  an  oriented  right  triangle  in  the  square  with  discrepancy  [19|! 


3.  Applications 

In  this  section,  we  briefly  describe  the  applications  of  our  work  to  problems  involving 
bin  packing,  dynamic  allocation,  wafer-scale  integration,  and  testing  pseudorandom  num¬ 
ber  generators.  The  applications  range  widely  in  difficulty,  and  some  are  quite  elegant. 
Due  to  space  limitations,  however,  we  will  mostly  just  refer  the  reader  to  relevant  papers 
in  the  literature. 


3.1.  Wafer-Scale  Integration 

The  minimax  grid  matching  problem  was  first  studied  in  the  context  of  wafer-scale 
integration  of  two-dimensional  systolic  arrays  [13].  Only  later  wzus  its  association  with 
up-right  matching  noticed. 

When  constructing  two-dimensional  arrays  on  a  single  wafer,  the  designer  is  typically 
presented  with  a  regular  y/N  x  y/N  array  of  chip-size  cells,  some  fraction  of  which  are 
functional.  The  task  is  to  connect  the  functional  cells  into  a  smaller  square  array  in  a  way 
that  minimizes  the  length  of  the  longest  wire  needed  to  connect  adjacent  cells  in  the  array. 
For  example.  Figure  7  illustrates  the  connection  of  a  3  x  3  array  of  functional  cells  from  a 
4x4  array  containing  9  functional  cells. 


Defective 

cell 


Figure  7:  Connection  o/  o  3  x  3  array  of  functional  cells. 

Unfortunately,  the  designer’s  task  is  iVF-hard  in  general.  However,  the  impact  of  this 
difficulty  is  mitigated  by  two  practical  considerations.  First,  the  defective  cells  tend  to 
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be  located  randomly.  Second,  an  algorithm  need  only  work  well  for  most  (not  all)  wafers. 
Hence,  an  algorithm  which  works  with  high  probability  for  randomly  located  faults  is  very 
acceptable  in  practice. 

The  minimax  grid  matching  problem  suggests  exactly  such  an  algorithm.  First  match 
the  functioning  cells  one-to-one  with  an  imaginary  regularly  spaced  grid  of  the  same  car¬ 
dinality  in  a  way  that  minimizes  the  longest  matching  length.  By  the  work  in  this  paper, 
this  length  will  be  0(log^'^'‘ //)  with  very  high  probability. 

The  one-to-one  association  of  functioning  cells  with  imaginary  grid  points  assigns  a 
label  (i,  j)  to  each  functioning  cell.  It  remains  only  to  hook  up  cells  whose  labels  differ 
by  one  in  precisely  one  coordinate.  Since  the  distance  between  any  two  adjacent  cells  (say 
(i,j)  and  (i,j  -f-1),  for  example)  is  at  most  the  distance  between  (t,j)  and  its  matching  grid 
point  (this  is  0(log^^*  N)  with  very  high  probability)  plus  the  distance  between  (t,j  +  1) 
and  its  matching  grid  point  (also  0(log^^^  N)  with  very  high  probability)  plus  the  distance 
between  the  two  matching  grid  points  (constant),  every  wire  will  have  length  0(log^^^  A^) 
with  very  high  probability. 

Of  course,  this  analysis  assumes  nearly  widthless  wires,  but  for  some  applications  this 
is  reasonable.  Previously,  the  best  upper  bound  proved  for  this  problem  was  O  (log  TV)  by 
Leighton  and  Leiserson  [13].  The  best  lower  bound  known  remains  the  trivial  bound  of 

n(v/i^). 


3.2.  Two-Dimensional  Bin  Packing 

The  maximum  up-right  matching  problem  was  first  defined  in  the  context  of  two- 
dimensional  bin  packing  [lOj.  In  fact,  the  algorithm  proposed  by  Karp,  Luby  and  Marchetti- 
Spaccamela  for  two-dimensional  bin  packing  specifically  uses  the  maximum  up-right  match¬ 
ing  algorithm  as  a  subroutine.  In  what  follows,  we  briefly  describe  the  problem  and  the 
results.  We  refer  the  interested  reader  to  [lOj  for  a  description  of  the  algorithm  and  the 
details  of  the  analysis. 

Given  a  collection  of  N  items  from  [0,1]  x  [0,  l|,  the  two-dimensional  bin  packing 
problem  is  to  pack  the  N  items  into  the  minimum  possible  number  of  unit-square  bins. 
Of  course,  the  two-dimensional  bin  packing  problem  is  A'F-complete.  However,  when 
the  N  items  are  chosen  independently  and  uniformly  from  [O,  l]  x  [0,l|,  the  Karp-Luby- 
Marchetti-Spaccamela  algorithm  uses  j  -|-  ©(W)  bins  and  wastes  0(VT)  space  with  very 
high  probability,  where  W  =  Q{y/N  \og^^*  N)  is  the  anticipated  number  of  unmatched 
pluses  in  a  random  maiximum  up-right  matching  problem.  This  compares  favorably  with 
the  n(\//V)  lower  bound  on  expected  wasted  space  for  any  two-dimensional  bin  packing 
algorithm.  Whether  or  not  there  is  an  algorithm  which  achieves  o{\/N  \og^^*  N)  expected 
wasted  space  remains  unknown. 


3.3.  One-Dimensional  Din  Packing 

Somewhat  surprisingly,  the  apparently  two-dimensional  up-right  matching  problem 
also  arises  in  the  context  of  one-dimensional  bin  packing.  In  particular,  Shor  [21,  22]  showed 
that  the  space  wasted  by  the  Best  Fit  bin  packing  heuristic  when  packing  N  items  uniformly 
selected  from  [0,l|  is  ©(IT)  with  high  probability,  where  again  W  =  Q{y/N  \og^^*  N)  is 
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the  anticipated  number  of  unmatched  pluses  in  a  random  maximum  up-right  matching  ' 

problem.  Moreover,  the  same  upper  bound  holds  when  the  items  are  selected  from  any 
distribution  on  [0,1]  that  is  symmetric  about  1/2. 

The  Best  Fit  algorithm  is  one  of  the  most  common  on-line  algorithms  used  in  practice.  I 

The  algorithm  packs  each  item  as  it  “arrives”  (never  looking  ahead  at  subsequent  items)  in 
the  fullest  bin  into  which  it  fits.  Although  better  off-line  algorithms  are  known  (First  Fit 
Decreasing  achieves  the  optimal  bound  of  Q{\/N)  expected  wasted  space  [2]),  no  better  on¬ 
line  algorithm  is  known.  In  fact,  Shor  [21,  22]  used  a  related  matching  problem  to  prove 
an  Cl{y/N >/log  N)  lower  bound  on  the  expected  weisted  space  of  any  on-line  algorithm,  ! 

assuming  that  the  number  of  items  N  is  not  known  in  advance. 


3.4  Dynamic  Allocation 

The  up-right  matching  problem  also  arises  in  the  context  of  dynamic  algorithms  for 
file  storage.  In  particular,  the  Coffman-Leighton  [4]  Best-Fit-Aligned  (BFA)  algorithm  for 
dynamic  allocation  wastes  Q(W)  space  with  very  high  probability  where  W  is  the  expected 
number  of  unmatched  pluses  in  an  iV-point  random  up-right  matching  problem,  when  files 
arrive  and  depart  according  to  a  Poisson  process  where  N  is  the  expected  number  of  files 
in  storage  at  any  point  in  time.  Since  the  amount  of  used  (i.e.,  occupied)  space  in  storage 
at  any  time  is  easily  bounded,  this  means  that  the  capacity  of  the  storage  device  can  be 
predetermined  so  as  to  almost  never  waste  more  than  0{y/N  \og^^*  N)  space  and  so  cis  to 
almost  never  need  compaction.  Moreover,  the  results  hold  for  any  distribution  of  file  sizes 
in  [0,l[.  These  results  are  much  better  than  those  for  the  worst-case  algorithms  used  in 
practice,  which  commonly  waste  Q{N)  space  and  spend  a  constant  fraction  of  their  time 
in  amortized  compaction. 

Although,  we  do  not  have  room  to  describe  the  BFA  algorithm  in  detail  here,  we  can 
mention  a  simpler  balls  and  boxes  problem  with  related  performance.  Consider  an  infinite 
collection  of  boxes  in  a  line  1,2,3,....  Assume  initially  that  precisely  the  first  N  boxes 
are  filled.  At  each  step,  remove  one  of  the  balls  at  random,  and  then  try  to  insert  a  new 
ball  into  one  of  the  first  N  boxes  (chosen  at  random).  Boxes  can  contain  at  most  one  ball 
each,  so  if  we  are  unable  to  insert  the  new  ball  into  its  desired  box,  then  insert  the  ball  into 
the  leftmost  empty  box  to  the  right  of  the  originally  desired  box.  Continue  this  process 
indefinitely. 

It  is  clear  that  the  system  just  described  always  contains  exactly  N  balls,  but  the 
position  of  the  rightmost  ball  will  vary  with  time.  The  question  of  interest  in  dynamic 
allocation  involves  the  expected  location  of  the  rightmost  ball.  This  is,  of  course,  identical 
to  N  plus  the  number  of  empty  boxes  before  the  rightmost  ball  (i.e.,  the  wasted  space). 
By  a  rather  complicated  reduction  to  maximum  up-right  matching,  Coffman  and  Leighton 
[4]  showed  that  the  rightmost  position  at  any  time  T  >  N  \og  N  is  N  +  Q{\/N  \og^^*  N) 
with  very  high  probability.  Without  up-right  matching  to  rely  upon,  this  result  might  well 
have  been  impossible  to  obtain. 


3.5  Testing  Pseudorandom  Number  Generators 

All  of  the  applications  discussed  thus  far  are  directly  tied  to  one  of  the  matching 
problems.  The  corresponding  discrepancy  problems  also  have  applications,  but  most  are 
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to  problems  in  mathematical  statistics  (e.g.,  see  [23,  26])  and  are  of  limited  interest  to 
computer  scientists.  There  is  one  potential  application  that  could  be  of  interest,  however: 
using  the  minimax  grid  matching  problem  to  test  pseudorandom  number  generators.  As 
an  example,  consider  the  plot  of  “random  numbers”  from  the  IBM  PC  random  number 
generator  shown  in  Figure  8. 


Figure  8;  Plot  of  2000  random  pairs  (ii,Zj+i),  (i<+4, ...  generated  from  the  IBM 
PC  Random  Number  Generator  with  IBM  PC  Basie.  (Taken  from  [l8j.) 

The  points  in  Figure  8  (which  is  copied  from  (18|)  were  obtained  by  taking  consecutive 
pairs  of  “random  numbers"  and  using  them  as  x  and  y  coordinates.  In  this  case,  it  is 
readily  apparent  that  the  data  are  not  random.  One  way  of  detecting  this  formally  is 
to  observe  that  there  are  simply  connected  regions  with  area  A  and  perimeter  p  that 
contain  far  more  than  A  +  Q{p\og^^*  N)  points.  (Just  draw  the  boundary  around  one  of 
the  diagonal  clusters.)  Hence,  by  the  arguments  of  Sections  2.2  and  4,  the  minimax  grid 
matching  length  for  this  set  of  points  would  be  far  in  excess  of  ©(log*'^*  yv).  Hence,  we 
would  have  formal  grounds  for  deciding  that  the  IBM  PC  random  number  generator  is 
faulty. 

There  are  several  good  reasons  to  use  the  minimax  grid  matching  length  as  a  barom¬ 
eter  for  randomness.  First,  it  is  a  polynomial  time  test  which  in  some  sense  is  doing  an 
exponential  amount  of  work.  Because  of  Hall’s  Theorem,  we  know  that  if  the  minimax 
matching  length  is  d,  then  every  simply  connected  region  in  the  square  (of  which  there  are 
more  than  an  exponential  number)  is  verified  to  contain  about  the  right  number  of  random 
points  (i.e.,  between  A  —  0(pd)  and  A  -I-  Q{pd)).  Hence  large  discrepancies  in  odd  shaped 
regions  will  be  readily  detected,  even  if  they  are  not  easily  characterizable  or  readily  appar¬ 
ent  to  the  naked  eye.  Second,  the  test  will  also  identify  generators  that  are  “too  regular.” 
If  the  numbers  are  too  regular,  they  will  form  a  nearly  perfect  grid  resulting  in  a  minimax 
matching  length  that  is  too  small.  Third,  the  distribution  of  the  minimax  matching  length 
is  very  sharply  peaked  at  0(log*/^  N).  The  probability  of  deviating  from  the  peak  by  even 
a  constant  factor  is  vanishingly  (more  than  polynomially)  small.  Hence  small  deviations 
from  the  peak  would  provide  high  confidence  that  the  pseudorandom  number  generator  is 
not  random. 


The  major  drawback  to  using  the  minimax  matching  length  as  a  statistical  test  is  that 
we  have  not  been  able  to  completely  characterize  its  distribution.  This  would  have  to  be 
done  in  order  for  the  test  to  be  usable  for  fixed  values  of  N.  In  the  mean  time,  the  test 
can  only  be  used  as  a  well-motivated  heuristic.  In  fact,  we  are  currently  doing  just  that. 
Some  students  at  MIT  are  in  the  process  of  coding  several  tests  and  pseudorandom  number 
generators  of  various  types  to  see  how  they  compare. 

The  idea  of  using  a  matching  problem  as  a  test  for  a  pseudorandom  number  generator 
is  not  completely  new.  The  same  idea  (but  for  a  different  problem)  motivated  the  work 
of  Ajtai,  Komlos  and  Tusnady  [l].  Unfortunately,  they  also  were  unable  to  completely 
characterize  the  distribution  around  the  peak.  More  generally,  related  methods  can  be 
found  in  the  literature  on  spectral  and  Fourier  tests  [5]. 

4.  Proofs 

In  this  section,  we  prove  that  the  minimax  grid  matching  length  L{P)  for  N  random 
points  P  is  0(log^/^  N)  with  very  high  probability.  The  section  is  divided  into  four  subsec¬ 
tions.  In  Section  4.1,  we  formally  convert  the  minimax  grid  matching  problem  into  a  dual 
discrepancy  problem  with  a  straightforward  application  of  Hall’s  Theorem.  We  prove  the 
necessary  bounds  for  the  discrepancy  problem  in  Section  4.3.  Section  4.2  provides  some 
intuition  and  motivation  for  the  rather  complicated  proof  in  Section  4.3,  and  gives  some 
insight  into  why  the  eventual  answer  is  ©(log’^^JV)  instead  of  ©(v^log  N)  or  ©(logW).  In 
Section  4.4,  we  extend  the  result  of  Section  4.3  to  arbitrary  regions  in  the  plane.  The 
result  is  not  needed  for  the  rest  of  the  paper,  but  is  mathematically  more  interesting  and 
natural. 

4.1  Conversion  to  the  Dual  Discrepancy  Problem 

In  the  next  few  subsections,  we  will  restrict  our  attention  to  simply  connected  regions 
with  a  special  rectilinear  form.  In  particular,  we  define  a  partition  F  of  the  square  with 
area  N  into  s  subsquares,  each  with  side  length  log’^^  N.  In  Section  4.3,  we  will  prove 
that  there  is  a  constant  c  >  0  such  that  with  very  high  probability,  the  discrepancy  of  every 
simply  connected  region  whose  boundary  lies  along  the  edges  of  F  is  at  most  cp  log^^^  N 
where  p  is  the  perimeter  of  the  region.  As  a  consequence,  we  can  conclude  that  with  very 
high  probability,  the  discrepancy  of  every  region  (not  necessarily  connected  or  simple) 
whose  boundary  lies  along  the  edges  of  F  is  at  most  cp  log^^^  N  where  p  is  the  perimeter 
of  the  region.  (This  corollary  is  easily  proved  by  decomposing  an  arbitrary  region  into  the 
sum  and  difference  of  simply  connected  regions.)  In  this  subsection,  we  show  how  to  use 
this  result  to  prove  that  the  minimax  grid  matching  length  L{P)  for  N  random  points  P 
is  at  most  d  =  N)  with  very  high  probability. 

By  Hall’s  Theorem  [9),  it  is  sufficient  to  show  that  for  every  set  4  C  P  with  x  random 
points,  there  are  at  least  x  grid  points  within  distance  d  oi  A.  To  do  this,  we  define  a 
coarser  partition  F'  of  the  square  into  ^  subsquares  with  side  length  ^  where  (for  now) 
d  =  8clog^^*  N.  For  any  subset  A  C.  P ,  let  the  region  R  consist  of  all  the  subsquares  in  F' 
containing  a  point  of  A.  Let  R'  be  the  slightly  larger  region  formed  from  R  by  adding  an 
isosceles  right  triangle  with  hypotenuse  ^  to  every  ^-length  segment  along  the  perimeter  of 


R.  For  example,  see  Figure  9.  (For  now,  we  overlook  the  special  case  when  the  boundary 
of  R  coincides  with  the  boundary  of  the  overall  square.) 


Figure  9:  The  extension  of  R  to  R'. 


By  construction,  every  grid  point  of  R'  is  within  distance  d  of  some  point  in  A.  Hence 
it  remains  only  to  show  that  the  number  of  grid  points  in  R*  is  at  least  as  large  as  the 
number  of  random  points  in  A.  This  can  be  accomplished  by  observing  that  the  number 
of  grid  points  in  R'  is  at  least  the  area  of  R'  which  is 


2 


=  Area(R)  +  Per(R)c  log®^^  N 

whereas 

\A\  <  ylrea(iZ)  +  A(i2) 

<  Area{R)  +  Per{R)c  log’^'*  N  . 


This  completes  the  proof.  (Accounting  for  the  czise  when  the  boundary  of  R  coincides 
with  the  boundary  of  the  square  only  multiplies  d  by  a  constant  factor.) 


4.2  Decomposition  Into  Triangles  —  The  Intuition 

In  Section  4.3,  we  will  prove  that  with  very  high  probability,  every  simply  connected 
region  whose  boundary  lies  along  the  edges  of  F  has  discrepancy  at  most  0(p  log^^^iV) 
where  p  is  the  perimeter  of  the  region.  In  what  follows  we  provide  some  intuition  for  the 
result  by  proving  a  much  simpler  result  with  the  same  log®'^^  n  bound.  This  simpler  result 
deals  with  the  decomposition  of  a  polygonal  region  into  triangles  and  is  interesting  in  its 
own  right.  The  result  does  not  imply  the  desired  discrepancy  bound,  but  it  does  give  some 
of  the  basic  ideas  behind  it.  In  Section  4.3,  these  ideas  are  obscured  by  technical  details. 
Theorem  1:  Any  polygonal  region  R  with  n  vertices  and  perimeter  p  can  be  decomposed 
into  a  sum  and  difference  of  triangles  Tj  such  that  X),  \J Are a[Tj)  =  0(p log^'^^  n). 


Proof:  The  algorithm  we  use  to  decompose  the  region  into  a  sum  and  difference  of  triangles 
is  simple.  At  each  step,  we  reduce  the  number  of  vertices  of  the  polygon  by  half  by  cutting 
off  n/2  consecutive  triangles.  (See  Figure  10.)  In  particular,  we  cut  off  n/2  triangles 
formed  by  pairs  of  adjacent  sides.  We  concentrate  on  what  happens  when  we  remove  the 
yth  triangle,  which  is  formed,  say,  by  vertices  v,-i,  u,  and  y,>i.  Let  the  distance  between 
v,_i  and  u,+i  be  dy,  and  let  the  two  edges  and  VjU.+i  have  lengths  c,  and  Let 

Cj  =  e,  +  e,+i.  In  removing  the  triangle  we  remove  two  edges  totalling  length 

Cj,  and  add  an  edge  of  length  dj.  Thus,  we  reduce  the  perimeter  by  6j  =  Cj  —  dj. 


Figure  10;  Removing  four  triangles. 

We  must  show  that  this  algorithm  produces  a  decomposition  into  triangles  such  that 
Ylj  yj Area{Tj)  <  0{plog*^^n).  Removing  the  jth  triangle  reduces  the  perimeter  by  Sj  = 

Cj  —  dj.  The  area  of  this  triangle  is  at  most  -djyjc^  —  d*,  since  this  area  is  maximized  when 
~ 

We  therefore  get 

yjArea{Tj)  <  \d)''^(c]  -  d]y/* 

=  -  djY'* 

< 

We  repeat  this  step  of  cutting  off  half  the  vertices  until  we  reduce  the  number  of  sides 
to  2,  and  thus  have  no  area  left.  At  each  step  the  total  sum  of  the  Cj's  is  at  most  the 
perimeter  p,  so  since  there  are  logn  steps,  we  have  c,  <  plogn.  Since  6j  was  the 

change  in  the  perimeter  produced  by  removing  the  7th  triangle,  Sj  <  p.  Thus,  we 
have 

E”r?  iYT.Zl’J's]!' 

<2plog’'^^n. 

The  second  step  is  a  special  case  of  Holder’s  inequality.  I 


The  expected  discrepancy  of  a  triangle  T  is  Area{T)).  If  we  could  cissume  that 
the  average  discrepancy  of  a  triangle  in  the  decomposition  of  the  polygonal  region  R  was 
this  expected  discrepancy,  then  by  Theorem  1  the  total  discrepancy  of  the  region  R  would 
be  O(plog^^^n).  Proving  this  for  a  general  class  of  regions  R  is  the  most  difficult  part  of 
the  paper  and  is  done  in  Section  4.3. 

The  decomposition  used  in  the  proof  of  Theorem  1  is  basically  the  one  that  we  will  use 
for  the  general  result.  We  will  bound,  with  very  high  probability,  the  total  discrepancy  of 
all  the  n/2*  triangles  removed  at  the  kth  stage.  We  do  this  by  counting  the  number  of 
possible  sets  of  triangles  that  could  be  removed  at  the  kth  stage. 

Although  we  have  not  investigated  the  matter  formally,  the  bound  in  Theorem  I  is 
almost  surely  tight  in  the  worst  case. 

4.3  Formally  Bounding  the  Discrepancy  —  The  Proof 

In  this  section,  we  prove  the  following  central  result. 

Theorem  2:  Consider  a  set  of  N  points  uniformly  and  independently  distributed  in  the 
y/N  X  y/N  square,  and  define  P  to  be  a  grid  of  squares  with  edge  length  log®/'*  N.  Then  there 
is  a  constant  c  such  that  with  probability  at  least  1  —  every  simply  connected 

region  R  whose  boundary  follows  P  has  discrepancy  A{R)  <  cPer{R)  log®/*  N. 

The  proof  is  divided  into  two  sections,  one  deterministic  and  one  probabilistic.  In  the 
deterministic  section,  we  show  that  the  discrepancy  of  any  R  satisfying  the  hypothesis  can 
be  bounded  by  the  sum  of  the  discrepancies  of  2logp  disjoint  regions,  most  of  which  are 
drawn  from  “small  enough”  classes  of  “small  enough”  regions.  In  the  probabilistic  part  of  3 
the  proof,  we  establish  very  high  probability  bounds  on  the  discrepancies  of  the  regions  in 
the  classes,  thus  obtaining  an  upper  bound  on  the  discrepancy  of  any  R. 

4.3.1  The  Deterministic  Part  of  the  Proof 

The  deterministic  part  of  the  proof  of  Theorem  2  consists  of  showing  that  there  are 
collections  of  regions  Pi,,,p  of  the  y/N  x  y/N  square  and  a  constant  Cq  independent  of  N 
such  that: 

1)  the  area  of  every  region  in  is  at  most  2~'p^y/s, 

2)  the  number  of  regions  in  Di,,,p  is  at  most  7V*(s  log  ,  and 

3)  for  any  p  and  any  simply  connected  region  R  with  boundary  on  P  and  perimeter  p  in 
the  y/N  X  y/N  square,  there  exist  real  numbers  Si, . . . ,  and  regions  Di, . . . , 
and  D\,.. . ,  such  that 

3a)  Di,  D-  €  Di,ti,p  1  ^ 

3b)  A(fi:)  <  EZiiMD,)  +  A{D;)),  and 
3c)  Si  -h  •  •  +  5m  £  Co- 

The  Dt,s,p  are  defined  for  values  of  »  that  are  integers  in  the  range  [l,m],  where  m  = 
logp,  values  of  s  that  are  integer  multiples  of  1/  log  N  in  the  range  [1/  log  N,  A^|,  and  values 
of  p  that  are  integer  multiples  of  log®/*  N  in  the  range  [log®/*  N,  Nj.  In  fact,  we  will  assume 
without  loss  of  generality  that  p  is  a  power-of-two  multiple  of  log®/*  N. 


%  V 


The  regions  in  a  particular  need  not  be  connected  or  simple,  but  they  will  consist 
of  the  sum  and  difference  of  easily  described  polygons.  Very  roughly  speaking,  the  regions 
in  will  correspond  to  the  union  of  triangles  removed  in  the  (logn  —  t)th  halving 

operation  described  in  Section  4.2,  where  s  is  related  to  the  total  area  of  the  removed 
triangles  and  p  is  the  perimeter  of  the  original  region  being  decomposed.  The  precise 
characterization  of  the  P,  ,  p  is  fairly  difficult,  and  will  be  described  as  we  go  along. 

The  {Di}  and  {D,*}  for  a  region  R  are  derived  from  a  sequence  of  approximations 
i?!,..  .,Rm+i  to  R  where  i2i  =  0  and  Rm+i  =  R-  To  construct  the  sequence,  we  use  two 
different  sequences  of  piecewise  linear  closed  curves.  The  first  sequence  has  vertices  lying 
on  the  boundary  of  R.  We  call  the  ith  curve  of  this  kind  Ai.  The  Ai  are  then  approximated 
further  by  closed  curves  Bi  with  vertices  on  a  grid  G^.  If  Bi  is  a  simple  closed  curve,  then 
Ri  is  its  interior.  Although  we  begin  with  a  simple  closed  curve,  neither  the  A,  nor  the 
Bi  approximations  are  necessarily  simple  closed  curves,  which  causes  further  problems, 
forcing  us  to  define  a  region  “enclosed”  by  the  curve  B;.  This  “enclosed”  region  is  Ri, 
which  is  not  necessarily  connected,  even  though  R  is  connected. 

To  obtain  the  approximation  A,,  we  mark  2*  points  at  equal  distances  along  the  bound¬ 
ary  of  R.  We  call  these  points  Ujo,  an,  . . .,  The  starting  point  a,o  =  oq  is  the  same 

for  all  Ai  and  is  chosen  to  be  a  vertex  of  F.  We  then  join  these  points  in  order  by  edges. 
Half  the  vertices  of  Aj+i  are  also  vertices  of  A,,  specifically,  =  Ci+i^jy.  We  let  the  length 
of  the  edge  between  Uij-i  and  Oiy  be  tij. 

The  curve  Bi  is  obtained  by  approximating  the  curve  Aj  with  points  on  a  grid  Gi.  We 
let  the  jth  vertex  bij  of  Bi  be  the  nearest  grid  point  to  the  jth  vertex  of  Ai.  If  several 
grid  points  are  equidistant  to  some  vertex,  we  break  the  tie  arbitrarily. 

The  grid  Gi  has  points  spaced  evenly  at  distance  gt  =  p/(2'Vlogn).  The  grid  Gi+i  is 
a  refinement  of  Gi,  so  a  fourth  of  the  points  of  Gi+i  are  also  points  of  Gi.  We  denote  the 
edge  length  of  Gi  by  gi  =  gi(2*~^. 

We  have  now  produced  the  curve  Bi.  If  it  is  a  simple  closed  curve,  then  the  area  inside 
it  is  our  ith  approximation  Ri  to  the  region  R.  Otherwise,  we  must  do  some  more  work. 
If  an  area  is  enclosed  by  Bi  twice  or  more  (i.e.,  has  winding  number  >  2),  we  still  wish  to 
count  each  point  inside  it  at  most  once  when  calculating  the  discrepancy.  We  do  this  in 
the  following  manner.  If  the  winding  number  of  a  point  is  positive  with  respect  to  Bi,  we 
include  it  in  our  region.  If  the  winding  number  is  zero  or  negative,  we  do  not  include  it. 
This  gives  the  region  Ri  which  we  use  as  our  approximation  to  the  region  R. 

By  the  assumption  that  p  be  a  power-of-two  multiple  of  log®^"*  N,  we  know  that  A^+i 
coincides  with  the  boundary  of  R.  Since  the  vertices  of  F  are  a  subset  of  the  grid  points  in 
Gm+i,  this  means  that  Bm+i  is  identical  to  the  boundary  of  R  and  thus  Rm+i  =  R-  Since 
Bi  contains  only  2  vertices,  Ri  =  0.  Hence,  we  have  constructed  the  desired  sequence  of 
approximations  to  R.  The  {f?i}  and  {£>,*}  are  now  easily  defined  for  I  <  i  <  m  as  follows: 

Di  =  Ri+i/Ri  and 
D:  =  R,/R,+u 

where  the  notation  S1/S2  is  used  to  denote  the  set  of  points  contained  in  Si  but  not  in  S2. 

It  is  readily  observed  that 

A(Sj)  <  A(Si)  +  A(S,/S2)  -t-  A(S2/S,) 
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for  any  regions  Si  and  S2.  Thus  for  any  i, 

A{Ri+i)  ~  A{Ri)  <  A(A)  +  A(D;). 

Summing  over  1  <  *  <  m  and  using  Rm+i  =  R  and  Ri  =  0,  we  find  that 

MS)  <  fllMDi)  +  A(£>n|, 

i=l 


as  desired. 

Each  Di  and  Z?,*  is  contained  in  the  union  of  “triangles”  removed  from  Bj+i  to  produce 
Bi.  That  is  because  any  point  which  has  a  positive  winding  number  with  respect  to  Bj+i  but 
not  with  respect  to  Bi  must  be  contained  in  one  of  the  triangle-like  objects  removed  from 
Bi+i  to  form  B,-.  (The  objects  removed  from  Ai+i  to  form  A,  are  triangles,  albeit  possibly 
overlapping,  but  slight  deformations  of  this  basic  structure  are  possible  when  passing  to 
Bj+i  and  Bi  because  of  the  coarseness  of  the  approximation.)  We  will  eventually  use  this 
fact  to  show  that  the  areas  of  the  Dj’s  and  D,*’s  are  small.  Before  doing  this,  however,  we 
must  classify  D,  and  as  elements  of  Pi,t.p  for  some  s  and  p.  The  choice  of  p  is  easy;  it  is 
simply  the  perimeter  of  the  original  region  R.  Since  the  boundary  of  R  follows  F,  p  must 
be  an  integer  multiple  of  log*'^'*  N  in  the  range  (log®/'*  N,N\,  as  needed  for  the  definition  of 
In  fact,  we  can  increase  p  to  a  power-of-two  multiple  of  log®/*  N  by  appending  to  R 
a  zero-area  region  with  sufficiently  large  perimeter. 

The  selection  of  s  =  Sj  is  much  more  complicated  and  depends  on  the  triangle  decom¬ 
position  of  i2.  In  particular,  we  will  need  to  examine  the  triangles  Ti,  that  are  removed 
from  i4j+i  to  form  ^4,.  Specifically,  we  define  Tij  to  be  the  triangle  with  vertices  ai.y-i,  aij 
and  Recall  that  e,y  is  defined  to  be  the  distance  between  and  Uij  and  define 


2* 


P  7=1 


The  value  of  g,  is  a  normalized,  dimensionless  parameter  that  is  closely  related  to  the 
perimeter  of  ^4^  as  well  as  the  area  of  It  is  hard  to  provide  better  motivation  for 

this  definition  except  to  say  that  it  has  a  sufficiently  strong  form  so  that  the  inequalities 
will  all  work  out  right  later.  Unfortunately,  the  same  cannot  be  said  of  more  natural 
definitions.  The  following  fact,  in  particular,  will  prove  useful  later. 

Fact  1:  g,  <  g.+i  <  1  for  0  <  1  <  m  —  1. 

Proof:  To  show  that  <  1,  we  observe  that  the  2"*  edges  of  Am  each  have  length  at 
most  p/2"‘.  Thus 

9m  <  ^2”*(p/2"*)®  =  1. 

P 

To  show  that  qi  <  Qi+i,  we  apply  the  triangle  inequality  to  T,y  to  show  that  < 
2(e,-+i  2j~i  +  ®?+i  2;)-  Summing  over  1  <  j  <  2'  yields  that 

2*  2*'*'* 

E  <  2  E 

j=i  >=i 
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and  thus  that 


oi  i'  oi+l  2‘+‘ 

p  j=i  P  j=i 


which  means  that  9,-  <  I 

It  is  also  worthwhile  defining 

kij  =  2(e?^X  2,-1  +  Ci+|,2>)  ~ 

which  is  proportional  to  the  contribution  of  triangle  Tij  to  9,  ,  and 


—  ft+i  ~  Qi 

which  is  the  difference  in  q  for  A.+x  and  Ai.  By  Fact  1,  r,-  >  0  for  0  <  i  <  m  —  1  and 

m— 1 

^  =  9m  <  1. 

«=1 


Also  note  that 


V 

E 

i=i 


Lastly,  we  define 


*  ^  *'•-2 


+ 


2‘-i 


and  set  3,  to  be  the  smallest  integer  multiple  of  1/  log  N  that  is  greater  than  or  -  +/?/  log  AT 
where  a  and  /?  are  sufficiently  large  constants  to  be  determined  later.  (It  will  turn  out 
that  a  =  9eir  and  P  =  9c(l  -j-  167r)  are  sufficient  where  c  =  2.718....)  The  class  Pi  ,  p  is  then 
defined  to  be  the  set  of  all  Di  and  resulting  from  a  region  R  with  perimeter  p  for  which 
3  =  3^.  It  now  remains  to  prove  the  hypotheses  stated  at  the  beginning  of  this  section.  In 
particular,  we  still  must  show  that: 

1)  Arta{Di)  and  Area(Z7,*)  are  at  most  2“’p*y^, 

2)  the  number  of  Di  or  for  which  <  -Si  <  for  some  integer  t  is  at  most 
iV’f *•■"*,  and 

3)  Si  +  ••■  +  s,n  <  Co  where  co  is  a  constant  independent  of  N,  p  and  R. 

We  start  by  showing  that  the  areas  of  D,  and  D*  are  at  most  2~'p^y/si.  From  before, 
we  know  that  Di  and  D^  are  contained  in  the  union  of  the  triangle-like  objects  removed 
from  Bi+i  to  form  Bi.  In  particular,  define  Ei  =  (JyLx  define  Fi  to  be  the  set  of 

all  points  within  distance  \/2gi+i  of  a  point  in  Ei.  Then  Di,Di  C  Fi  and  it  is  sufficient 
to  show  Area{Fi)  <  2“*p*y^.  Since  Ei  can  be  bounded  with  two  closed  curves  each  with 
length  at  most  p,  it  is  easily  seen  that 


Area{Fi)  <  Area{Ei)  +  2\/2s^,+ip  -t-  27rg,?^i. 


The  area  of  Ei  is  bounded  by  the  sum  of  the  areas  of  the  triangles  Tj,  it  contains. 
The  area  of  T,j,  in  turn,  is  at  most  times  the  altitude  of  the  triangle.  A  simple 


geometric  argument  shows  that  the  altitude  is  at  most  where,  as  before,  fc,,  = 

Since  tij  <  p/2*,  this  means  that  Area{Tij)  <  pyJl^j/2*'*'^  and 

thus  that 

Area{Ei)  <  ^  \/^ 

^  ;=i 

2‘+2  ■ 

We  can  now  conclude  that 


Arca(Fi)  <  +  2\/2gj+,  + 

<  2-y  — 

\  4  plogW  2*‘*'Mog7V 


provided  only  that  ct  >  ^  and  0  >  f-  (Recall  that  =  p/2*'v/log  N  and  p  >  log^^^  N  since 
the  boundary  of  R  coincides  with  F.) 

We  next  show  that  the  number  of  Di  and  D*  for  which  <  s,  <  for  some  integer 

t  is  at  most  This  will  be  reasoned  with  a  Kolmogorov  complexity  type  of  argument. 
In  particular,  we  will  associate  every  set  of  sequential  approximations  (B,, . . . ,  for 

which  Si  <  with  a  unique  triple  of  lists.  The  first  list  will  consist  of  two  integers 
from  [l,fVj.  The  second  list  will  consist  of  2’‘''*  integers  drawn  from  [1,9].  The  third  list 
will  consist  of  2*'*'^  integers  that  sum  to  at  most  ^2''^^{t  —  0  +  16a),  where  a  and  0  are 
the  constants  in  the  definition  of  s*  =  ar'  +  0/  log  N .  This  will  be  sufficient  to  prove  the 
desired  result  since  the  number  of  such  triples  of  lists  is  at  most 


N’(9)“  '  V,. - 

=  -  0 +  I6a+ ^)\ 


provided  that  a  >  9en  and  0  >  16a -f  ^  >  9e(l  +  167r).  Notice  that  we  used  the  well-known 
inequality  <  (^)'^  in  the  third  step  of  the  calculation. 
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We  start  describing  the  association  of  {fli, . . . ,  B,>i}  with  lists  by  considering  the  ways 
that  Bi  can  be  extended  to  form  B,+i.  First  we  note  that  for  every  vertex  6,,  of  J9,,  the 
corresponding  vertex  6i+i,2;  of  5,>i  is  either  the  same  point  or  an  adjacent  point  of  the 
grid  Hence  there  are  9  possible  ways  to  refine  the  approximation  of  each  vertex  of 

Bi  in  Bi+i. 


21  10  14 

20  9  2  6  15 

•••  13  5  1  3  11  ••• 

19  8  4  7  16 

18  12  17 


Figure  11:  Numbering  the  grid  points. 

The  remaining  vertices  of  Bj+i  fall  “between”  consecutive  vertices  of  Bj.  To  specify  the 
location  of  vertex  we  label  all  the  grid  points  of  starting  with  the  midpoint 

of  the  edge  of  B,.  The  labelling  is  done  in  increasing  order  of  distance  from  the 

midpoint.  For  example,  see  Figure  11.  Since  the  edges  of  Ai+i  have  length  at  most 
the  next  vertex  of  B^+i  conceivably  could  be  as  far  as  p/2''^*  away  from  this  midpoint  (up  to 
the  0(gi+i)  error  induced  by  the  approximation).  This  is  essentially  y/logTf  grid  points  of 
Gi+i  away.  However,  most  points  will  be  considerably  closer,  resulting  in  a  much  narrower 
range  of  choices  for  most  We  will  prove  this  using  the  following  elementary  fact 

from  geometry. 

Fact  2:  Let  UV  denote  a  line  segment  and  let  \UV\  denote  its  length.  The  locus  of  points 
W  determined  by  2(|[/W|*  +  IVWI^)  —  IC/Fl*  =  fc  is  a  circle  of  radius  ^\/lc  about  the 
midpoint  of  UV . 

By  Fact  2  the  point  bi^i  2j-i  of  ^»+i  will  be  in  a  circle  of  radius  p  =  \ 
centered  on  the  midpoint  of  the  edge  bij-ibij.  (Recall  that  kij  —  2(e?^j  2j-i  +  27)  ~ 

and  that  B,^.,  is  an  approximation  to  Since  we  labelled  the  points  in  order  of 

increasing  distance  from  the  midpoint,  the  label  l,y  of  the  point  6, >1 27-1  will  be  less  than 
the  number  of  grid  points  in  that  circle.  The  number  of  grid  points  of  G.+i  in  a  circle  of 
radius  p  is  at  most  7r(p  +  v/2)*/3,>i •  It  follows  that  the  label  of  the  point  6j+i  27-1 

satisfies  the  equation 


Summing  this  inequality  for  /,y  above,  we  get  that 


+  167r2‘ 

i=i  y=i 


<  "  fP^Ui6.2* 

‘2ghi  V  2*  ) 

—  7r2*'''*r,  log  AT -h  167r2’. 

The  preceding  analysis  indicates  that  the  number  of  options  for  B,+i  given  B,  is  limited 
in  a  fundamental  way  by  r^.  In  particular  B.-n  can  be  completely  specified  by  B,,  r,, 
a  list  of  2*  numbers  from  (1,9),  and  a  list  of  2*  positive  integers  that  sum  to  at  most 
7r2''''*ri  log  +  167r2*.  Applying  this  process  recursively,  we  find  that  {i?2i  can 

be  completely  specified  by  Bi,  r|,  a  list  of  2*  +  2*”‘  +  .  . .  +  2  <  2‘^‘  numbers  from  [1,9], 
and  a  list  of  2‘^*  positive  integers  that  sum  to  at  most 

1 

X]{7r2‘'+‘ri- logiV  +  167r2‘')  <  7rr'2‘+Mog  iV  +  167r2‘+‘. 

i'  =  l 

If  we  restrict  our  attention  to  those  {Bi, . . . ,  B,+i}  for  which  s,  <  t/logi'/,  then  the 
preceding  sum  is  at  most 


Si  log  Af  -  /? 


+  167r2*+*  < 


{t  -  13  +  167r). 


Since  Bi  contains  only  two  points  from  C?i,  it  can  be  completely  specified  by  naming 
the  points.  Since  Gi  has  at  most  N  points,  this  is  accomplished  with  a  list  of  two  numbers 
from  Hence,  every  {Bi, . . . ,  B.+j}  for  which  can  be  uniquely  specified  by 

a  triple  of  lists:  one  with  two  numbers  from  [1,  Af),  one  with  2''^*  numbers  from  [1,9],  and 
one  with  2*^^  integers  that  sum  to  at  most  —  (3  +  IGtt).  As  argued  previously,  this 

means  that  the  number  of  D,  and  £),*  for  which  s,  <  is  at  most 

The  last  step  in  the  deterministic  part  of  the  proof  is  to  show  that  the  maximum 
of  si  +  •  •  •  +  s,n  over  all  regions  R  is  bounded  by  a  constant  independent  of  N  and  p. 
Fortunately,  this  is  easy  to  do  since 

p.  <- 

^  {0+l)m 

log  AT  ^ 


(/?  +  l)logp  ,  r, 


log  N 

m 

<  /3  +  1  +  a  5Z  2r> 

y=i 

<  p  +  l  +  2a 


;=1 i=j 


which  is  a  constant  independent  of  N  and  p. 


4.3.2  The  Probabilistic  Part  of  the  Proof 

The  probabilistic  part  of  the  proof  of  Theorem  2  consists  of  bounding  the  discrepancies 
of  the  regions  in  the  To  do  this,  we  will  make  use  of  the  following  simple  fact,  which 

can  be  proved  by  an  elementary  counting  argument. 

Fact  3:  Given  a  region  D  with  area  A  in  the  v/N  x  y/N  square,  the  probability  that 
A(Z5)  >  ^  is  at  most 

O  for  g  <  yi  and 

O  for  q>A, 

where  c\  is  a  constant  independent  of  D  and  N . 

In  order  to  bound  the  maximum  discrepancy  of  a  region  in  P,  ,  p  with  very  high  prob¬ 
ability,  it  is  sufficient  to  find  a  q  such  that 

and 

Iog(»logA/)^-ei4  ^  0^2“'°**^*^^ 

where  A  <  2~*p^%/s.  This  is  because  the  probability  that  a  fixed  region  in  P,,i,p  has 
discrepancy  q  or  greater  is  or  0(e~‘’**)  depending  on  whether  or  not  q  <  A, 

and  because  Pj  ,p  only  contains  at  most  jV*2*’^‘ regions. 

Solving  for  q  =  qi  \n  the  first  inequality  we  find  that  it  is  sufficient  for 

qi  =  O  ylllog^*^*  N  +  2’  log(s  log  N)\^ 

<  O  log^/'*  N  +  ps‘/yiog(slogfV)) 

<  O  {2~'^^p  log’^*  N  -t-  ps^^*  log(s  log  A  ))  . 

Solving  for  g  =  in  the  second  inequality,  we  find  that  it  is  sufficient  for 

g,  =  0(log"/*Af -f  2‘  Iog(.slogA^)) 

=  0(log*^^iV-f-2MoglogfV). 

Hence  with  very  high  probability,  the  maximum  discrepancy  of  a  region  in  D,  ,p  is  at  most 
g  =  +  g? 

=  0(2~'^^p\og^^*  N  +  ps^^*  log(slog  N)  +  log®^^  N  +  2'  log  log  N). 

Since  there  are  only  a  polynomial  number  of  combinations  of  i,  s  and  p,  we  can  conclude 
that  the  preceding  bound  holds  with  very  high  probability  for  all  P,,j,p.  From  Section  4.3.1, 
we  know  that  for  every  region  R, 

A(fl)<f:(A(D.)  +  A(D.-)) 


where  Di,  D*  6  A,..,p  for  1  <  i  <  m  and  Si  +  •  •  •  <  cq  for  some  constant  cq.  Thus, 

with  very  high  probability,  the  discrepancy  of  any  region  R  whose  boundary  follows  F  is 
at  most 


(logp 

log^'^*  N  +  ps]'*  log(5,  log  N)  +  log®^^  N  +  2'  log  log  N\ 

where  E!=V  <  cq. 

We  will  consider  the  impact  of  each  of  the  four  terms  in  the  sum  individually.  The  first 
term  in  the  sum  forms  a  geometric  series  that  sums  to  0(p  log^^'*  iV).  The  second  term  in 
the  sum  is  the  most  important  and  lies  at  the  heart  of  the  proof.  Because  log(x  log  A^) 
is  convex  in  x,  we  know  by  Jensen’s  inequality  that  the  value  of  log(5,  log  N)  is 

maximized  when  the  s,  are  all  equal.  Hence  these  terms  sum  to 

C)(log  A/’p(co/logiV)‘/^logCo)  =  ©(plog*''^  Af). 

The  third  term  in  the  sum  trivially  adds  up  to  0(logplog®^*  W)  which  is  0{p\o^^^  N) 
provided  that  p  >  log^^'*  N  log  log  N .  The  fourth  term  in  the  sum  forms  another  geometric 
series  that  converges  to  0(p  log  log  Af). 

The  preceding  argument  concludes  the  proof  of  Theorem  2  except  in  the  special  case 
when  p  <  log^*^^  A'^  log  log  A'^.  For  this  case,  we  can  forget  Section  4.3.1  and  apply  Fact  3 
directly. 

Since  such  regions  can  contain  at  most  (log  log  iV)*  interlinked  square  cells  of  F,  there 
are  at  most  0[N^)  such  regions.  Hence,  with  very  high  probability,  the  maximum  discrep¬ 
ancy  of  such  a  region  is  at  most  9  =  +  92  where 

<  o(2''°»’''^)  and 

Solving  the  equations  yields 


<7:  =  0(v/Ilog*/^N) 

<  0[p\oz^l*N) 

and 

92  -  0(log'/'A^) 

<  0(p\og^'*  N) 

since  p  >  log^^'*  iV  and  .4  <  O  p‘‘).  Thus,  with  very  high  probability,  the  discrepancy  of 
any  region  with  p  <  iog^^^  log  log  N  is  ^(p  log^'^^  N)  and  we  are  finally  done  with  the  proof 
of  Theorem  2! 

It  is  worth  remarking  that  the  probability  bound  in  Theorem  2  is  essentially  tight. 
For  example,  the  discrepancy  of  the  upper  half  of  the  \/ N  x  \/ N  square  will  exceed 
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ci\/N  N  with  probability  2“**'°**^*^  for  some  constant  ci  and  cj-  In  fact,  this  ex¬ 
ample  illustrates  the  relationship  between  the  constants  in  the  ©(plog®'^*  iV)  bound  and 
the  constant  in  the  exponent  of  the  very  high  probability  bound.  With  further  work,  it  is 
conceivable  that  the  exact  relationship  could  be  specified. 

4.4  Extension  to  Arbitrary  Regions 

In  what  follows,  we  show  how  to  extend  the  discrepancy  bound  in  Theorem  2  to 
arbitrary  simply  connected  regions.  The  result  is  not  essential  to  the  rest  of  the  paper 
(Theorem  2  is  sufficient),  but  it  is  more  natural  than  Theorem  2  and  may  eventually  prove 
to  be  equally  important. 

Theorem  3:  Consider  a  set  of  N  points  uniformly  and  independently  distributed  in  the 
y/N  X  y/N  square.  There  is  a  constant  c  such  that  with  probability  1  —  ,  every 

simply  connected  region  R  has  discrepancy  A{R)  <  cPer{R)  log^^^  N  +  c  log*^*  N. 

Proof:  The  proof  is  not  difficult  given  Theorem  2  so  we  will  only  sketch  the  argument 
here.  Divide  R  into  two  regions  R'  and  iE"  where  R'  consists  of  all  log*/^  N  x  log^^^  N 
square  subregions  of  F  that  are  entirely  contained  in  R  and  where  R"  =  R/R'.  Although 
R'  need  not  be  connected,  we  can  conclude  immediately  from  Theorem  2  that  A(iE')  < 
0(p' log*^^  N)  with  very  high  probability  where  p'  is  the  perimeter  of  R'.  By  elementary 
geometry,  p'  <  6p  where  p  is  the  perimeter  of  R  and  hence  A(iE')  <  0(plog®/^iV)  with 
very  high  probability.  Since  A(i2)  <  A{R')  +  A{R"),  it  remains  only  to  show  that  A{R")  < 
0{p  N  +  log^/^  N)  with  very  high  probability. 

With  very  high  probability,  every  square  subregion  of  F  contains  at  most  0(log®/*  N) 
random  points.  Since  any  square  containing  part  of  /E"  must  contain  part  of  the  simple 
closed  curve  boundary  JE,  at  most  0(1  -f  p/log®^*  N)  squares  can  contain  part  of  R".  Any 
such  square  can  contribute  at  most  0{\og^^*  N)  to  the  discrepancy  of  R"  since  all  squares 
have  0(log*^*  iV)  area  and  0(log*^*N)  random  points  with  very  high  probability.  Thus 
A{R")  <  0(log^^*  N  +  plog’^^  N)  with  very  high  probability,  as  claimed.  I 

It  is  worth  remarking  that  a  slightly  stronger  result  can  be  proved  if  the  very  high 
probability  assumption  is  relaxed.  In  particular,  with  high  probability,  every  region  with 
perimeter  p  has  discrepancy  at  most  0(Iog  N  +  plog®^^  N).  (This  can  be  proved  by  noting 
that  Theorem  3  handles  the  case  when  p  >  log®^^iV,  and  then  observing  that  there  are 
at  most  a  polynomial  number  of  regions  with  p  <  log^^^W.)  The  log^^^  W  factor  cannot 
be  improved  since  Shor  (22)  h2is  demonstrated  the  existence  of  regions  having  discrepancy 
n(plog^^*  N)  for  p  =  0(\/7V)  with  very  high  probability. 

5.  Remarks 

Upon  reflection,  it  really  should  not  be  surprising  that  the  minimax  grid  matching 
problem  arises  in  so  many  diverse  and  useful  applications.  After  all,  its  dual  discrepancy 
problem  captures  a  very  important  measure  of  expected  discrepancies  in  random  data.  In 
fact,  it  is  precisely  the  measure  that  is  important  to  the  analysis  of  many  algorithms. 

It  is  likely  that  other  applications  of  this  work  will  be  discovered,  and  even  more 
likely  still  that  other  matching  and/or  discrepancy  problems  will  arise  in  the  analysis 
of  algorithms.  One  such  problem  that  still  remains  unsolved  is  the  rightward  matching 


problem.  In  rightward  matching,  we  have  N  random  pluses  and  N  random  minuses 
in  a  unit  square,  and  are  asked  to  match  pluses  rightward  to  minuses  in  a  way  which 
minimizes  the  average  vertical  distance  V (P'^)  of  the  matching  edges.  (Unmatched  pluses 
are  considered  to  match  to  the  top  or  bottom  of  the  square.)  As  a  consequence  of  the 
minimax  grid  matching  result  in  this  paper,  we  can  conclude  that  V{P'^)  <  0{\og^^*  N) 
with  very  high  probability.  Improvement  of  the  lower  bound  of  n(ylog  N)  would  directly 
lead  to  improved  lower  bounds  on  the  expected  wasted  space  of  any  on-line  bin  packing 
algorithm  [21,  22]. 

Discrepancy  problems  are  still  of  interest  in  mathematical  statistics.  For  example, 
Dudley  [7]  leaves  open  the  question  concerning  the  maximum  expected  discrepancy  of  any 
convex  3-dimensional  region.  Curiously,  this  appears  to  be  analogous  to  the  2-dimensional 
up-right  discrepancy  problem  analyzed  in  this  paper.  We  suspect  that  the  bound  is  again 
0(v/^  log^'^'*  N]  but  have  not  proved  it. 

Multidimensional  matching  and  discrepancy  results  will  also  be  likely  to  have  conse¬ 
quences  for  multidimensional  packing  and  allocation  problems.  For  example,  see  [lOj. 
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